10 research outputs found

    Privacy-Aware Recommender Systems Challenge on Twitter's Home Timeline

    Full text link
    Recommender systems constitute the core engine of most social network platforms nowadays, aiming to maximize user satisfaction along with other key business objectives. Twitter is no exception. Despite the fact that Twitter data has been extensively used to understand socioeconomic and political phenomena and user behaviour, the implicit feedback provided by users on Tweets through their engagements on the Home Timeline has only been explored to a limited extent. At the same time, there is a lack of large-scale public social network datasets that would enable the scientific community to both benchmark and build more powerful and comprehensive models that tailor content to user interests. By releasing an original dataset of 160 million Tweets along with engagement information, Twitter aims to address exactly that. During this release, special attention is drawn on maintaining compliance with existing privacy laws. Apart from user privacy, this paper touches on the key challenges faced by researchers and professionals striving to predict user engagements. It further describes the key aspects of the RecSys 2020 Challenge that was organized by ACM RecSys in partnership with Twitter using this dataset.Comment: 16 pages, 2 table

    Search for dark matter produced in association with bottom or top quarks in √s = 13 TeV pp collisions with the ATLAS detector

    Get PDF
    A search for weakly interacting massive particle dark matter produced in association with bottom or top quarks is presented. Final states containing third-generation quarks and miss- ing transverse momentum are considered. The analysis uses 36.1 fb−1 of proton–proton collision data recorded by the ATLAS experiment at √s = 13 TeV in 2015 and 2016. No significant excess of events above the estimated backgrounds is observed. The results are in- terpreted in the framework of simplified models of spin-0 dark-matter mediators. For colour- neutral spin-0 mediators produced in association with top quarks and decaying into a pair of dark-matter particles, mediator masses below 50 GeV are excluded assuming a dark-matter candidate mass of 1 GeV and unitary couplings. For scalar and pseudoscalar mediators produced in association with bottom quarks, the search sets limits on the production cross- section of 300 times the predicted rate for mediators with masses between 10 and 50 GeV and assuming a dark-matter mass of 1 GeV and unitary coupling. Constraints on colour- charged scalar simplified models are also presented. Assuming a dark-matter particle mass of 35 GeV, mediator particles with mass below 1.1 TeV are excluded for couplings yielding a dark-matter relic density consistent with measurements

    Deep Learning Applied to SEM Images for Supporting Marine Coralline Algae Classification

    No full text
    The classification of coralline algae commonly relies on the morphology of cells and reproductive structures, along with thallus organization, observed through Scanning Electron Microscopy (SEM). Nevertheless, species identification based on morphology often leads to uncertainty, due to their general plasticity. Evolutionary and environmental studies featured coralline algae for their ecological significance in both recent and past Oceans and need to rely on robust taxonomy. Research efforts towards new putative diagnostic tools have recently been focused on cell wall ultrastructure. In this work, we explored a new classification tool for coralline algae, using fine-tuning pretrained Convolutional Neural Networks (CNNs) on SEM images paired to morphological categories, including cell wall ultrastructure. We considered four common Mediterranean species, classified at genus and at the species level (Lithothamnion corallioides, Mesophyllum philippii, Lithophyllum racemus, Lithophyllum pseudoracemus). Our model produced promising results in terms of image classification accuracy given the constraint of a limited dataset and was tested for the identification of two ambiguous samples referred to as L. cf. racemus. Overall, explanatory image analyses suggest a high diagnostic value of calcification patterns, which significantly contributed to class predictions. Thus, CNNs proved to be a valid support to the morphological approach to taxonomy in coralline algae

    Deep Learning Applied to SEM Images for Supporting Marine Coralline Algae Classification

    No full text
    The classification of coralline algae commonly relies on the morphology of cells and reproductive structures, along with thallus organization, observed through Scanning Electron Microscopy (SEM). Nevertheless, species identification based on morphology often leads to uncertainty, due to their general plasticity. Evolutionary and environmental studies featured coralline algae for their ecological significance in both recent and past Oceans and need to rely on robust taxonomy. Research efforts towards new putative diagnostic tools have recently been focused on cell wall ultrastructure. In this work, we explored a new classification tool for coralline algae, using fine-tuning pretrained Convolutional Neural Networks (CNNs) on SEM images paired to morphological categories, including cell wall ultrastructure. We considered four common Mediterranean species, classified at genus and at the species level (Lithothamnion corallioides, Mesophyllum philippii, Lithophyllum racemus, Lithophyllum pseudoracemus). Our model produced promising results in terms of image classification accuracy given the constraint of a limited dataset and was tested for the identification of two ambiguous samples referred to as L. cf. racemus. Overall, explanatory image analyses suggest a high diagnostic value of calcification patterns, which significantly contributed to class predictions. Thus, CNNs proved to be a valid support to the morphological approach to taxonomy in coralline algae

    Development of a Knowledge-Based Expert System for Diagnosing Post-Harvest Diseases of Apple

    No full text
    Post-harvest diseases are one of the main causes of economical losses in the apple fruit production sector. Therefore, this paper presents an application of a knowledge-based expert system to diagnose post-harvest diseases of apple. Specifically, we detail the process of domain knowledge elicitation for constructing a Bayesian network reasoning system. We describe the developed expert system, dubbed BN-DSSApple, and the diagnostic mechanism given the evidence provided by the user, as well as a likelihood evidence method, learned from the estimated consensus of users’ and expert’s interactions, to effectively transfer the performance of the model to different cohorts of users. Finally, we detail a novel technique for explaining the provided diagnosis, thus increasing the trust in the system. We evaluate BN-DSSApple with three different types of user studies, involving real diseased apples, where the ground truth of the target instances was established by microbiological and DNA analysis. The experiments demonstrate the performance differences in the knowledge-based reasoning mechanism due to heterogeneous users interacting with the system under various conditions and the capability of the likelihood-based method to improve the diagnostic performance in different environments

    Non Stationary Multi-Armed Bandit: Empirical Evaluation of a New Concept Drift-Aware Algorithm

    No full text
    The Multi-Armed Bandit (MAB) problem has been extensively studied in order to address real-world challenges related to sequential decision making. In this setting, an agent selects the best action to be performed at time-step t, based on the past rewards received by the environment. This formulation implicitly assumes that the expected payoff for each action is kept stationary by the environment through time. Nevertheless, in many real-world applications this assumption does not hold and the agent has to face a non-stationary environment, that is, with a changing reward distribution. Thus, we present a new MAB algorithm, named f-Discounted-Sliding-Window Thompson Sampling (f-dsw TS), for non-stationary environments, that is, when the data streaming is affected by concept drift. The f-dsw TS algorithm is based on Thompson Sampling (TS) and exploits a discount factor on the reward history and an arm-related sliding window to contrast concept drift in non-stationary environments. We investigate how to combine these two sources of information, namely the discount factor and the sliding window, by means of an aggregation function f(.). In particular, we proposed a pessimistic (f=min), an optimistic (f=max), as well as an averaged (f=mean) version of the f-dsw TS algorithm. A rich set of numerical experiments is performed to evaluate the f-dsw TS algorithm compared to both stationary and non-stationary state-of-the-art TS baselines. We exploited synthetic environments (both randomly-generated and controlled) to test the MAB algorithms under different types of drift, that is, sudden/abrupt, incremental, gradual and increasing/decreasing drift. Furthermore, we adapt four real-world active learning tasks to our framework—a prediction task on crimes in the city of Baltimore, a classification task on insects species, a recommendation task on local web-news, and a time-series analysis on microbial organisms in the tropical air ecosystem. The f-dsw TS approach emerges as the best performing MAB algorithm. At least one of the versions of f-dsw TS performs better than the baselines in synthetic environments, proving the robustness of f-dsw TS under different concept drift types. Moreover, the pessimistic version (f=min) results as the most effective in all real-world tasks

    Booker Prediction From Requests For Quotation Via Machine Learning Techniques

    Get PDF
    Purpose – Many incoming requests for quotation usually compete for the attention of accommodation service provider staff on a daily basis, while some of them might deserve more priority than others. Design – This research is therefore based on the correspondence history of a large booking management system that examines the features of quotation requests from aspiring guests in order to learn and predict their actual booking behavior. Approach – In particular, we investigate the effectiveness of various machine learning techniques for predicting whether a request will turn into a booking by using features such as the length of stay, the number and type of guests, and their country of origin. Furthermore, a deeper analysis of the features involved is performed to quantify their impact on the prediction task. Findings – We based our experimental evaluation on a large dataset of correspondence data collected from 2014 to 2019 from a 4-star hotel in the South Tyrol region of Italy. Numerical experiments were conducted to compare the performance of different classification models against the dataset. The results show a potential business advantage in prioritizing requests for proposals based on our approach. Moreover, it becomes clear that it is necessary to solve the class imbalance problem and develop a proper understanding of the domain-specific features to achieve higher precision/recall for the booking class. The investigation on feature importance also exhibits a ranking of informative features, such as the duration of the stay, the number of days prior to the request, and the source/country of the request, for making accurate booking predictions. Originality of the research – To the best of our knowledge, this is one of the first attempts to apply and systematically harness machine learning techniques to request for quotation data in order to predict whether the request will end up in a booking

    Measurements of ttˉt\bar{t} differential cross-sections of highly boosted top quarks decaying to all-hadronic final states in pppp collisions at s=13\sqrt{s}=13\, TeV using the ATLAS detector

    No full text
    Measurements are made of differential cross-sections of highly boosted pair-produced top quarks as a function of top-quark and ttˉt\bar{t} system kinematic observables using proton--proton collisions at a center-of-mass energy of s=13\sqrt{s} = 13 TeV. The data set corresponds to an integrated luminosity of 36.136.1 fb1^{-1}, recorded in 2015 and 2016 with the ATLAS detector at the CERN Large Hadron Collider. Events with two large-radius jets in the final state, one with transverse momentum pT>500p_{\rm T} > 500 GeV and a second with pT>350p_{\rm T}>350 GeV, are used for the measurement. The top-quark candidates are separated from the multijet background using jet substructure information and association with a bb-tagged jet. The measured spectra are corrected for detector effects to a particle-level fiducial phase space and a parton-level limited phase space, and are compared to several Monte Carlo simulations by means of calculated χ2\chi^2 values. The cross-section for ttˉt\bar{t} production in the fiducial phase-space region is 292±7 (stat)±76(syst)292 \pm 7 \ \rm{(stat)} \pm 76 \rm{(syst)} fb, to be compared to the theoretical prediction of 384±36384 \pm 36 fb

    Measurements of ttˉt\bar{t} differential cross-sections of highly boosted top quarks decaying to all-hadronic final states in pppp collisions at s=13\sqrt{s}=13\, TeV using the ATLAS detector

    No full text
    Measurements are made of differential cross-sections of highly boosted pair-produced top quarks as a function of top-quark and ttˉt\bar{t} system kinematic observables using proton--proton collisions at a center-of-mass energy of s=13\sqrt{s} = 13 TeV. The data set corresponds to an integrated luminosity of 36.136.1 fb1^{-1}, recorded in 2015 and 2016 with the ATLAS detector at the CERN Large Hadron Collider. Events with two large-radius jets in the final state, one with transverse momentum pT>500p_{\rm T} > 500 GeV and a second with pT>350p_{\rm T}>350 GeV, are used for the measurement. The top-quark candidates are separated from the multijet background using jet substructure information and association with a bb-tagged jet. The measured spectra are corrected for detector effects to a particle-level fiducial phase space and a parton-level limited phase space, and are compared to several Monte Carlo simulations by means of calculated χ2\chi^2 values. The cross-section for ttˉt\bar{t} production in the fiducial phase-space region is 292±7 (stat)±76(syst)292 \pm 7 \ \rm{(stat)} \pm 76 \rm{(syst)} fb, to be compared to the theoretical prediction of 384±36384 \pm 36 fb
    corecore